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Abstract 

High frequency data in finance have led to a deeper understanding on probabihty 
distributions of market prices. Several facts seem to be well stablished by empirical 
evidence. Specifically, probability distributions have the following properties: (i) 
They are not Gaussian and their center is well adjusted by Levy distributions, (ii) 
They are long-tailed but have finite moments of any order, (iii) They are self-similar 
on many time scales. Finally, (iv) at small time scales, price volatility follows a non- 
diffusive behavior. We extend Merton's ideas on speculative price formation and 
present a dynamical model resulting in a characteristic function that explains in 
a natural way all of the above features. The knowledge of such distribution opens 
a new and useful way of quantifying financial risk. The results of the model agree 
— with high degree of accuracy — with empirical data taken from historical records 
of the Standard & Poor's 500 cash index. 



1 Introduction 



One of the most important problems in mathematical finance is to know the 
probability distribution of speculative prices. In spite of its importance for 
both theoretical and practical applications the problem is yet unsolved. The 
first approach to the problem was given by Bachelier in 1900 when he mod- 
elled price dynamics as an ordinary random walk where prices can go up and 
down due to a variety of many independent random causes. Consequently 
the distribution of prices was Gaussian [1]. The normal distribution is ubiq- 
uitous in all branches of natural and social sciences and this is basically due 
to the Central Limit Theorem: the sum of independent, or weakly dependent, 
random disturbances, all of them with finite variance, results in a Gaussian 
random variable. Gaussian models are thus widely used in finance although, 
as Kendall first noticed [2], the normal distribution does not fit financial data 
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specially at the wings of the distribution. Thus, for instance, the probability 
of events corresponding to 5 or more standard deviations is around 10"^ times 
larger than the one predicted by the Gaussian distribution, in other words, the 
empirical distributions of prices are highly leptokurtic. Is the existence of too 
many of such events, the so called outliers, the reason for the existence of "fat 
tails" and the uselessness of the normal density specially at the wings of the 
distribution. Needless to say that the tails of the price distributions are crucial 
in the analysis of financial risk. Therefore, obtaining a reliable distribution has 
deep consequences from a practical point of view [3,4]. 

One of the first attempts to explain the appearance of long tails in financial 
data was taken by Mandelbrot in 1963 [5] who, based on Pareto-Levy stable 
laws [6], obtained a leptokurtic distribution. Nevertheless, the price to pay is 
high: the resulting probability density function has no finite moments, except 
the first one. This is indeed a severe limitation and it is not surprising since 
Mandelbrot's approach can still be considered within the framework of the 
Central Limit Theorem, that is, the sum of independent random disturbances 
of infinite variance results in the Levy distribution which has infinite vari- 
ance [6]. On the other hand, the Levy distribution has been tested against 
data in a great variety of situations, always with the same result: the tails of 
the distribution are far too long compared with actual data. In any case, as 
Mantegna and Stanley have recently shown [7], the Levy distribution fits very 
well the center of empirical distributions — much better than the Gaussian 
density — and it also shares the scaling behavior shown in data [7-10]. 

Therefore, if we want to explain speculative price dynamics as a sum of weakly 
interdependent random disturbances, we are confronted with two different 
and in some way opposed situations. If we assume finite variance the tails 
are "too thin" and the resulting Gaussian distribution only accounts for a 
narrow neighborhood at the center of the distribution. On the other hand, the 
assumption of infinite variance leads to the Levy distribution which explains 
quite well a wider neighborhood at the center of distributions but results in 
"too fat tails". The necessity of having an intermediate model is thus clear 
and this is the main objective of the paper. 

Obviously, since the works of Mandelbrot [5] and Fama [11] on Levy dis- 
tributions, there have been several approaches to the problem, some of them 
applying cut-off procedures of the Levy distribution [12,13] and, more recently, 
the use of ARCH and GARCH models to obtain leptokurtic distributions [14]. 
The approaches based on cut-off procedures are approximations to the dis- 
tributions trying to better fit the existing data, but they are not based on a 
dynamical model that can predict their precise features. On the other hand 
ARCH [15] and GARCH [16] models are indeed dynamical adaptive models 
but they do not provide an overall picture of the market dynamics resulting in 
a distinctive probabihty distribution. In fact, ARCH/GARCH models usually 



2 



assume that the market is Gaussian with an unknown time-varying variance 
so to be self-adjusted to obtain predictions. 

The paper is organized as follows. In Sect. 2 we propose the stochastic model 
and set the mathematical framework that leads to a probability distribution 
of prices. In Sect. 3 we present the main results achieved by the model. Con- 
clusions are drawn in Sect. 4. 



2 Analysis 

Let S{t) be a random processes representing stock prices or some market 
index value. The usual hypothesis is to assume that S{t) obeys an stochastic 
differential equation of the form 

S/S = p + F{t), (1) 

where p is the instantaneous expected rate of return and F(t) is a random 
process with specified statistics, usually F{t) is zero-mean Gaussian white 
noise, F(t) = ^{t), in other words dW{t) = ^{t)dt, where W{t) is the Wiener 
process or Brownian motion. In this case, the dynamics of the market is clear 
since the return R(t) = log[S (t) / S (0)] obeys the equation R — p + C{t) which 
means that returns evolve like an overdamped Brownian particle driven by the 
"inflation rate" p and, in consequence, the return distribution is Gaussian. 

Let us take a closer look at the price formation and dynamics. Following Mer- 
ton [17] we say that the change in the stock price (or index) is basically due to 
the random arrival of new information. This mechanism is assumed to produce 
a marginal change in the price and it is modelled by the standard geometric 
Brownian motion defined above. In addition to this "normal vibration" in 
price, there is an "abnormal vibration" basically due to the (random) arrival 
of important new information that has more than a marginal effect on price. 
Merton models this mechanism as a jump process with two sources of ran- 
domness: the arrival times when jumps occurs, and the jump amplitudes. The 
result on the overall picture is that the noise source F(t) in price equation is 
now formed by the sum of two independent random components 

Fit)^m+m, (2) 

where ^{t) is Gaussian white noise corresponding to the normal vibration, 
and f{t) is "shot noise" corresponding to the abnormal vibration in price. 
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This shot noise component can be exphcitly written as 

oo 

fit)^J2^kS{t-h), (3) 

k=l 

where S{t) is the Dirac deha function, are jump amphtudcs, and are 
jump arrival times. It is also assumed that Ak and tk are independent ran- 
dom variables with known probability distributions given by h{x) and ip{t) 
respectively [18]. 

We now go beyond this description and specify the "inner components" of 
the normal vibration in price, by unifying this with Merton's abnormal com- 
ponent. We thus assume that all changes in the stock price (or index) are 
modelled by different shot-noise sources corresponding to the detailed arrival 
of information, that is, we replace the total noise F{t) by the sum 

m 

m = E m, (4) 

n=no 

where f„{t) are a set of independent shot-noise processes given by 

oo 

fn{t)= E Ak„,Jit-tkr.,n). (5) 

The amplitudes A^^^^n are independent random variables with zero mean and 
probability density function (pdf), hn{x), depending only on a single "di- 
mensional" parameter which, without loss of generality, we assume to be the 
standard deviation of jumps i-e., 

hn{x) = a-^h{xa-^). (6) 

We also assume that the occurrence of jumps is a Poisson process, in this case 
shot noises are Markovian, and the pdf for the time interval between jumps is 
exponential: 

'0(^fen,n — tkn-l,n) = Anexp[— A„(ife„_„ — ife„-l,n)], (7) 

where A„ are mean jump frequencies, i.e., 1/A„ is the mean time between two 
consecutive jumps [18]. Finally, we order the mean frequencies in a decreasing 
way: A„ < A„_i. 
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Let X{t) be the zero-mean return, i.e., X{t) = R{t) — pt. For our model X{t) 
reads 

m oo 

x{t)= y: e A,„,j{t-t,„,^), (8) 

n=no fe„=l 



where 9{t) is the Heaviside step function. Our main objective is to obtain 
an expression for the pdf of X{t), p{x,t), or equivalently, the characteristic 
function (cf) of X(t), p{uj, t), which is the Fourier transform of the pdf p(a;, t). 
Note that X{t) is a sum of independent jump processes, this allows us to 
generalize Rice's method for a single Markov shot noise to the present case of 
many shot noises [19]. The final result is 

p{uj, t) = exp I -t E -^n[l - K'^^^n)] \ ■ (9) 
I n=no J 



As it is, X{t) represents a shot noise process with mean frequency of jumps 
given hj \ — Yl Ki and jump distribution given by h{x) — ^ \nhn{x)/X. Nev- 
ertheless, we make a further approximation by assuming (i) no = — cxo, i.e., 
there is an infinite number of shot-noise sources, and (ii) there is no charac- 
teristic time scale limiting the maximum feasible value of jump frequencies, 
thus A„ — > oo as n — > — oo. Both assumptions are based on the fact that 
the "normal vibration" in price is formed by the addition of (approximately) 
infinitely many random causes, which wc have modelled as shot noises. Ac- 
cording to this, wc introduce a "coarse-grained" description and replace the 
sum in Eq. (9) by an integral 

-t J X{u)[l - h{uja{u))]du\ . (10) 



In order to proceed further we should specify a functional form for A(m) and 
a{u). We note by empirical evidence that the bigger a sudden market change 
is, the longer is the time we have to wait until we observe it. Therefore, since 
\{u) decreases with u (recall that frequencies are decreasingly ordered) then 
o"(m) must increase with u. We thus see that c^u) has to be a positive definite, 
regular and monotone increasing function for all u. The simplest choice is: 
a{u) — (Toe". On the other hand, there is empirical evidence of scaling prop- 
erties in financial data [7-10]. We summarize the above requirements {i.e., 
inverse relation between A and a, and scaling) by imposing the "dispersion 
relation" : 

A = Ao(ao/a)«. (11) 
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where a is the scahng parameter. Under these assumptions the cf of the return 
X{t) reads: 

(CTm 'j 
-Xota^ J z-^-"[l - h{ujz)]dz \ , (12) 



where am — (Joe"'" is the maximum value of the standard deviation. We observe 
that if am = oo, which means that some shot-noise source has infinite variance, 
then Eq. (12) yields the Levy distribution 

La{u;,t) — e:K.p{—ktu;°'), (13) 



where 



oo 



k = Xoa^ J ^-^-"[1 - h{z)]dz. (14) 



Hence, if we want a distribution with finite moments, we have to assume a 
finite value for am- 

Let Am be the mean frequency corresponding to the maximum (finite) variance. 
Recall that, in the discrete case {cf. Eq. (9)), shot-noise sources are ordered, 
thus Xm and am correspond to the mean frequency and the variance of the last 
jump source considered. Our last assumption is that the total number of noise 
sources in Eq. (8) increases with the observation time t and, since Uq — — oo, 
this implies that m = m{t) is an increasing function of time. Consequently, 
the mean period of the last jump source, A~^, also grows with t. The simplest 
choice is the linear relation: Xmt = a, where a > is constant. Therefore, from 
the dispersion relation, Eq. (11), we see that the maximum jump variance 
depends on time as a power law: 

al = {btf/-, (15) 



where b = a^Xo/a. We finally have 



p{uj, t) = exp < 



(bt) 



\/0L 



-abt 



z-^-^il - h{ujz)]dz 



(16) 



6 



3 Results 



Let us now present the main results and consequences of the above analysis. 
First, the volatility of the return is given by 

I — a. I — a. 



which proves that a < 2 and the volatility shows super-diffusion. The anoma- 
lous diffusion behavior of the empirical data (at least at small time scales) 
was first shown by Mantegna and Stanley without mention it [20,21]. Second, 
kurtosis is constant and given by 

(2-a)^/^(-)(0) 
^ (4 - a)a ■ ^'^^ 



Thus 72 > for all in other words, we have a leptokurtic distribution in all 
time scales. Third, the return probability distribution scales as 

p(a;,t) = (6t)-^/Xx/(6t)^/") (19) 



and the model becomes self-similar [7-10]. 

In Fig. 1 wc plot the super- diffusion behavior. Circles correspond to empirical 
data from S&P 500 cash index during the period January 1988 to Decem- 
ber 1996. Solid line shows the super-diffusive character predicted by Eq. (17) 
setting a = 1.30 and a6^/" = 2.44 x 10~® (if time is measured in minutes). 
Dashed hne represents normal-diffusion {X'^{t)) oc t. Observe that data obeys 
super- diffusion for t < 10 min, and when t > 10 min there seems to be a 
"crossover" to normal diffusion. 

We finally study the asymptotic behavior of our distribution. It can be shown 
from Eq. (12) that the center of the distribution, defined by \x\ < {btY^"', 
is again approximated by the Levy distribution defined above. On the other 
hand the tails of the distribution are solely determined by the jump pdf h{u) 
by means of the expression 

oo 

pM^j^ f u-h{u)du, (|a;| » (6t)V"). (20) 

\x\/(Jm, 



Therefore, return distributions present fat tails and have finite moments if 
jump distributions behave in the same way. This, in turn, allows us to make 
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t (min.) 

Fig. 1. Second moment of the zero- mean return. Circles correspond to empirical 
data from S&P 500 cash index (January 1988 to December 1996). Solid line shows 
the super-diffusive character predicted by Eq. (17). 

statistical hypothesis on the form of h{u) based on the empirical form and 
moments of the pdf. 

In Fig. 2 we plot the probability density p{x, t) of the S&P 500 cash index 
returns X{t) observed at time t = 1 min (circles). S = 1.87 x 10""^ is the 
standard deviation of the empirical data. Dotted line corresponds to a Gaus- 
sian density with standard deviation given by S. Solid line shows the Fourier 
inversion of Eq. (12) with a = 1.30, o"^ = 9.07 x 10"^, and a = 2.97 x 10"^. 
We use the gamma distribution of the absolute value of jump amplitudes, 

h{u) = /l^l'^-^e-'^l"! /2r(/3), (21) 



with /3 = 2.39, and = ^ l3{i3 + 1) = 2.85. Dashed line represents a sym- 
metrical Levy stable distribution of index a = 1.30 and the scale factor 
k — 4.31 X 10~^ obtained from Eq. (14). We note that the values of cr^ and E 
predict that the Pareto-Levy distribution fails to be an accurate description 
of the empirical pdf for x ^ 5E (see Eq. (20)). 

We chose a gamma distribution of jumps because (i) as suggested by the em- 
pirical data analized, the tails of p{x, t) decay exponentially, and (ii) one does 
not favor too small size jumps, i.e., those jumps with almost zero amplitudes. 
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Fig. 2. Probability density function for t = 1 min. Circles represent empirical 

data from S&P 500 cash index (January 1988 to December 1996). S is the standard 
deviation of empirical data. Dotted line corresponds to the Guassian density. Dashed 
line is the Levy distribution and the solid line is the Fourier inversion of Eq. (22) 
with a gamma distribution of jumps (see main text for details). 

In any case, it would be very useful to get some more microscopic approach 
(based, for instance, in a "many agents" model [3,22]) giving some inside on 
the particular form of h{u). 



4 Conclusions 

Summarizing, by means of a continous description of random pulses, we have 
obtained a dynamical model leading to a probability distribution for the spec- 
ulative price changes. This distribution which is given by the following char- 
acteristic function: 

p(c;,t) = exp|-a y" z'^-'^W -h{ujza^{t))\d^, (22) 

where crm(t) = (6t)^/", it depends on three positive constants: a, 6, and a < 2. 
The characteristic function (22) also depends on an unknown function h{Lo\ 
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the unit-variance characteristic function of jumps, also to be conjectured and 
fitted from the tails of the empirical distribution. Therefore, starting from 
simple and reasonable assumptions we have developed a new stochastic process 
that possesses many of the features, i. e. fat tails, self-similarity, super diffusion, 
and finite moments, of financial time series, thus providing us with a different 
point of view on the dynamics of the market. We finally point out that the 
model does not explain any correlation observed in empirical data (as some 
markets seem to have [4,23]). This insufficiency is due to the fact that we have 
modelled the behavior of returns through a mixture of independent sources 
of white noise. The extension of the model to include non-white noise sources 
and, hence, correlations will be presented soon. 
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